智能论文笔记

A direct time-of-flight image sensor with in-pixel surface detection and dynamic vision

Istvan Gyongy , Ahmet T. Erdogan , Neale A. W. Dutton , Germán Mora Martín , Alistair Gorman , Hanning Mai , Francesco Mattioli Della Rocca , Robert K. Henderson

分类：计算机视觉

2022-09-23

3D Flash LiDAR是传统扫描激光雷达系统的替代方法，有望在紧凑的外形尺寸中进行精确的深度成像，并且没有运动部件，例如自动驾驶汽车，机器人技术和增强现实（AR）等应用。通常在图像传感器格式中使用单光子，直接飞行时间（DTOF）接收器实施，设备的操作可能会受到需要在室外场景中处理和压缩的大量光子事件的阻碍以及对较大数组的可扩展性。我们在这里提出了一个64x32像素（256x128 spad）DTOF成像器，该成像器通过将像素与嵌入式直方图使用像素一起克服这些局限性，该直方直方图锁定并跟踪返回信号。这大大降低了输出数据帧的大小，可在10 kfps范围内或100 kfps的最大帧速率进行直接深度读数。该传感器可选择性地读数检测表面或传感运动的像素，从而减少功耗和片外处理要求。我们演示了传感器在中端激光雷达中的应用。

translated by 谷歌翻译

Pixels2Pose: Super-Resolution Time-of-Flight Imaging for 3D Pose Estimation

Alice Ruget , Max Tyler , Germán Mora Martín , Stirling Scholes , Feng Zhu , Istvan Gyongy , Brent Hearn , Steve McLaughlin , Abderrahim Halimi , Jonathan Leach

分类：计算机视觉

2021-10-11

单光子敏感的深度传感器正在越来越多地用于人类姿势和手势识别的下一代电子。但是，具有成本效益的传感器通常具有低空间分辨率，从而将其用于基本运动识别和简单的对象检测。在这里，我们执行一个时间到空间映射，从而大大增加了简单飞行时间传感器的分辨率，即〜初始分辨率为4 $ \ times $ 4像素到分辨率32 $ \ times $ 32像素的深度图像。然后，可以将输出深度图用于准确的三维人姿势估计多人。我们开发了一个新的可解释框架，该框架为我们的网络如何利用其输入数据提供了直觉，并提供了有关相关参数的关键信息。我们的工作大大扩展了简单的飞机飞行时间传感器的用例，并为将来应用于具有相似数据类型的其他类型的传感器（即雷达和声纳）开辟了有希望的可能性。

translated by 谷歌翻译

The Undesirable Dependence on Frequency of Gender Bias Metrics Based on Word Embeddings

Francisco Valentini , Germán Rosati , Diego Fernandez Slezak , Edgar Altszyler

分类：自然语言处理 | 人工智能

2023-01-02

Numerous works use word embedding-based metrics to quantify societal biases and stereotypes in texts. Recent studies have found that word embeddings can capture semantic similarity but may be affected by word frequency. In this work we study the effect of frequency when measuring female vs. male gender bias with word embedding-based bias quantification methods. We find that Skip-gram with negative sampling and GloVe tend to detect male bias in high frequency words, while GloVe tends to return female bias in low frequency words. We show these behaviors still exist when words are randomly shuffled. This proves that the frequency-based effect observed in unshuffled corpora stems from properties of the metric rather than from word associations. The effect is spurious and problematic since bias metrics should depend exclusively on word co-occurrences and not individual word frequencies. Finally, we compare these results with the ones obtained with an alternative metric based on Pointwise Mutual Information. We find that this metric does not show a clear dependence on frequency, even though it is slightly skewed towards male bias across all frequencies.

translated by 谷歌翻译

Countering Malicious Content Moderation Evasion in Online Social Networks: Simulation and Detection of Word Camouflage

Álvaro Huertas-García , Alejandro Martín , Javier Huertas Tato , David Camacho

分类：自然语言处理 | 人工智能

2022-12-27

Content moderation is the process of screening and monitoring user-generated content online. It plays a crucial role in stopping content resulting from unacceptable behaviors such as hate speech, harassment, violence against specific groups, terrorism, racism, xenophobia, homophobia, or misogyny, to mention some few, in Online Social Platforms. These platforms make use of a plethora of tools to detect and manage malicious information; however, malicious actors also improve their skills, developing strategies to surpass these barriers and continuing to spread misleading information. Twisting and camouflaging keywords are among the most used techniques to evade platform content moderation systems. In response to this recent ongoing issue, this paper presents an innovative approach to address this linguistic trend in social networks through the simulation of different content evasion techniques and a multilingual Transformer model for content evasion detection. In this way, we share with the rest of the scientific community a multilingual public tool, named "pyleetspeak" to generate/simulate in a customizable way the phenomenon of content evasion through automatic word camouflage and a multilingual Named-Entity Recognition (NER) Transformer-based model tuned for its recognition and detection. The multilingual NER model is evaluated in different textual scenarios, detecting different types and mixtures of camouflage techniques, achieving an overall weighted F1 score of 0.8795. This article contributes significantly to countering malicious information by developing multilingual tools to simulate and detect new methods of evasion of content on social networks, making the fight against information disorders more effective.

translated by 谷歌翻译

Emergent communication enhances foraging behaviour in evolved swarms controlled by Spiking Neural Networks

Cristian Jimenez Romero , Alper Yegenoglu , Aarón Pérez Martín , Sandra Diaz-Pier , Abigail Morrison

分类：神经与进化计算

2022-12-16

Social insects such as ants communicate via pheromones which allows them to coordinate their activity and solve complex tasks as a swarm, e.g. foraging for food. This behaviour was shaped through evolutionary processes. In computational models, self-coordination in swarms has been implemented using probabilistic or action rules to shape the decision of each agent and the collective behaviour. However, manual tuned decision rules may limit the behaviour of the swarm. In this work we investigate the emergence of self-coordination and communication in evolved swarms without defining any rule. We evolve a swarm of agents representing an ant colony. We use a genetic algorithm to optimize a spiking neural network (SNN) which serves as an artificial brain to control the behaviour of each agent. The goal of the colony is to find optimal ways to forage for food in the shortest amount of time. In the evolutionary phase, the ants are able to learn to collaborate by depositing pheromone near food piles and near the nest to guide its cohorts. The pheromone usage is not encoded into the network; instead, this behaviour is established through the optimization procedure. We observe that pheromone-based communication enables the ants to perform better in comparison to colonies where communication did not emerge. We assess the foraging performance by comparing the SNN based model to a rule based system. Our results show that the SNN based model can complete the foraging task more efficiently in a shorter time. Our approach illustrates that even in the absence of pre-defined rules, self coordination via pheromone emerges as a result of the network optimization. This work serves as a proof of concept for the possibility of creating complex applications utilizing SNNs as underlying architectures for multi-agent interactions where communication and self-coordination is desired.

translated by 谷歌翻译

The RPM3D project: 3D Kinematics for Remote Patient Monitoring

Alicia Fornés , Asma Bensalah , Cristina Carmona-Duarte , Jialuo Chen , Miguel A. Ferrer , Andreas Fischer , Josep Lladós , Cristina Martín , Eloy Opisso , Réjean Plamondon

分类：人工智能

2022-12-09

This project explores the feasibility of remote patient monitoring based on the analysis of 3D movements captured with smartwatches. We base our analysis on the Kinematic Theory of Rapid Human Movement. We have validated our research in a real case scenario for stroke rehabilitation at the Guttmann Institute5 (neurorehabilitation hospital), showing promising results. Our work could have a great impact in remote healthcare applications, improving the medical efficiency and reducing the healthcare costs. Future steps include more clinical validation, developing multi-modal analysis architectures (analysing data from sensors, images, audio, etc.), and exploring the application of our technology to monitor other neurodegenerative diseases.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Knowledge Distillation for Federated Learning: a Practical Guide

Alessio Mora , Irene Tenison , Paolo Bellavista , Irina Rish

分类：机器学习

2022-11-09

Federated Learning (FL) enables the training of Deep Learning models without centrally collecting possibly sensitive raw data. This paves the way for stronger privacy guarantees when building predictive models. The most used algorithms for FL are parameter-averaging based schemes (e.g., Federated Averaging) that, however, have well known limits: (i) Clients must implement the same model architecture; (ii) Transmitting model weights and model updates implies high communication cost, which scales up with the number of model parameters; (iii) In presence of non-IID data distributions, parameter-averaging aggregation schemes perform poorly due to client model drifts. Federated adaptations of regular Knowledge Distillation (KD) can solve and/or mitigate the weaknesses of parameter-averaging FL algorithms while possibly introducing other trade-offs. In this article, we provide a review of KD-based algorithms tailored for specific FL issues.

translated by 谷歌翻译

Adaptive QoS of WebRTC for Vehicular Media Communications

Ángel Martín , Daniel Mejías , Zaloa Fernández , Roberto Viola , Josu Pérez , Mikel García , Gorka Velez , Jon Montalbán , Pablo Angueira

分类：计算机视觉

2022-08-24

车载传感器的车载系统正在增强连接。这使信息共享能够实现对环境的更全面的理解。但是，通过公共蜂窝网络的同行通信带来了多个网络障碍以解决，需要网络系统来中继通信并连接无法直接连接的各方。 Web实时通信（WEBRTC）是跨车辆流媒体流媒体的良好候选者，因为它可以使延迟通信较低，同时将标准协议带到安全握手中，发现公共IP和横向网络地址转换（NAT）系统。但是，在基础架构中的端到端服务质量（QOS）适应，在该基础架构中，传输和接收是通过继电器解耦的，需要一种机制来有效地使视频流适应网络容量。为此，本文通过利用实时运输控制协议（RTCP）指标（例如带宽和往返时间）来调查解决分辨率，帧和比特率更改的机制。该解决方案旨在确保接收机上系统及时获得相关信息。在实际的5G测试台中分析了应用不同方法适应方法时对端到端吞吐量效率和反应时间的影响。

translated by 谷歌翻译

HTML版本

Evo* 2022 -- Late-Breaking Abstracts Volume

A. M. Mora , A. I. Esparcia-Alcázar

分类：神经与进化计算 | 人工智能 | 机器学习

2022-08-01

4月20日至22日，在马德里（西班牙）举行的EVO* 2022会议上提交了末期摘要。这些论文介绍了正在进行的研究和初步结果，这些结果研究了对不同问题的不同方法（主要是进化计算）的应用，其中大多数是现实世界中的方法。

translated by 谷歌翻译